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Abstract 

Background: SUMOylation, as part of the epigenetic regulation of transcription, has been intensively studied in lower 
eukaryotes that contain only a single SUMO protein; however, the functions of SUMOylation during mammalian 
epigenetic transcriptional regulation are largely uncharacterized. Mammals express three major SUMO paralogues: 
SUMO-1, SUMO-2, and SUMO-3 (normally referred to as SUMO-1 and SUMO-2/3). Herpesviruses, including Kaposi's 
sarcoma associated herpesvirus (KSHV), seem to have evolved mechanisms that directly or indirectly modulate the 
SUMO machinery in order to evade host immune surveillance, thus advancing their survival. Interestingly, KSHV 
encodes a SUMO E3 ligase, K-bZIP, with specificity toward SUMO-2/3 and is an excellent model for investigating the 
global functional differences between SUMO paralogues. 

Results: We investigated the effect of experimental herpesvirus reactivation in a KSHV infected B lymphoma cell line 
on genomic SUMO-1 and SUMO-2/3 binding profiles together with the potential role of chromatin SUMOylation in 
transcription regulation. This was carried out via high-throughput sequencing analysis. Interestingly, chromatin 
immunoprecipitation sequencing (ChlP-seq) experiments showed that KSHV reactivation is accompanied by a significant 
increase in SUMO-2/3 modification around promoter regions, but SUMO-1 enrichment was absent. Expression profiling 
revealed that the SUMO-2/3 targeted genes are primarily highly transcribed genes that show no expression changes 
during viral reactivation. Gene ontology analysis further showed that these genes are involved in cellular immune 
responses and cytokine signaling. High-throughput annotation of SUMO occupancy of transcription factor binding sites 
(TFBS) pinpointed the presence of three master regulators of immune responses, IRF-1, IRF-2, and IRF-7, as potential 
SUMO-2/3 targeted transcriptional factors after KSHV reactivation. 

Conclusion: Our study is the first to identify differential genome-wide SUMO modifications between SUMO paralogues 
during herpesvirus reactivation. Our findings indicate that SUMO-2/3 modification near protein-coding gene promoters 
occurs in order to maintain host immune-related gene unaltered during viral reactivation. 




Genomics 



* Correspondence: pcchangSSym.edu.tw; hkungJSnhri.org.tw 

'institute of Microbiology and Immunology, National Yang-Ming University, 

Taipei 1 1221, Taiwan 

''UC Davis Cancer Center, University of California, Davis, CA 95616, USA 
Full list of author information is available at the end of the article 

O© 201 3 Chang et al.; licensee BioMed Central ttd. This is an open access article distributed under the terms of the Creative 
BIOIVIGCI CGntr3l commons Attribution ticense (http://creativecommons.Org/licenses/by/2.0), which permits unrestricted use, distribution, and 
reproduction in any medium, provided the original work is properly cited. 



Chang ef al. BMC Genomics 2013, 14:824 
http://www.biomedcentral.com/1471-2164/14/824 



Page 2 of 23 



Background 

SUMOylation was initially identified as a reversible post- 
translational modification that controls a variety of cellu- 
lar processes, including cellular signal transduction, rep- 
lication, chromosome segregation, and DNA repair 
[1-3]. The growing list of Small Ubiquitin-like Modifier 
(SUMO) substrates includes transcription factors and 
epigenetic regulators, which implies the involvement of 
the SUMO modification system in the epigenetic regula- 
tion of gene expression [4] and in the initiation and 
maintaining of heterochromatin silencing [5,6]. SUMO 
has been found in all eukaryotes but is not present in 
prokaryotes. The global regulatory role of SUMOylation 
in gene expression and protein interaction has been 
richly explored in lower eukaryotes such as yeast [7,8]. 
However, there is only a single SUMO protein in yeast, 
whereas there are three major protein conjugating iso- 
forms present in mammals; these are SUMO-1, and the 
highly similar SUMO-2 and SUMO-3, which are often 
refer to as SUMO-2/3. Recent reports have pinpoin- 
ted some important differences between SUMO-1 and 
SUMO-2/3. These are, firstly, that SUMO-1 is conju- 
gated to its substrates as a mono-SUMOylation, whereas 
SUMO-2/3 are able to form poly-SUMOylation chains 
[9]. Moreover, SUMO-1 acts like a chain terminator to 
the SUMO-2/3 polymers [10]. Secondly, inside cells, 
SUMO-1 appears mostly conjugated to proteins, 
whereas SUMO-2/3 are primarily found in the free form 
and are increased in conjugation to substrates when 
there are cellular stresses [11,12]. Thirdly, the kinetics of 
SUMO-1 de-conjugation is slower than that of SUMO- 
2/3 [13]. Fourthly, a preferential association of SUMO-1 
with the nuclear envelope and nucleolus, whereas 
SUMO-2/3 are distributed throughout the nucleoplasm 
[12]. Fifthly, although many substrates can be modified 
by both SUMO-1 and SUMO-2/3, some substrates are 
preferentially modified by one SUMO isoform or the 
other. The underlying complexity of SUMOylation has 
been extended by the identification of non-covalent 
interaction with effectors via SUMO interaction motifs 
(SIMs) [14]. SIMs are critical to both SUMO conjuga- 
tion and SUMO-mediated effects. Structure analysis 
shows the potential differential specificity of SIMs to- 
ward SUMO paralogues [15]. The specificity of the SIM 
in relation to the SUMO E3 ligase [16-18] and substrate 
[19] has been found to control SUMO paralogue- 
specific modification. Consequentially, this provides an 
additional interaction platform for the selective recruit- 
ment of SUMO-1 or SUMO-2/3 specific SIM-containing 
effector proteins. While numerous studies have provided 
considerable insight into the differences in specificity be- 
tween SUMO paralogues, their scope has been usually 
limited to a single host factor in each case. Discerning 
the genome-wide chromatin modification by SUMO 



paralogue during herpesvirus reactivation will greatly ad- 
vance our knowledge of their differential role in epigen- 
etic regulation and pathogenesis. 

Due to the functional flexibility and far-reaching down- 
stream consequences of SUMO, viruses have evolved dif- 
ferent strategies that are able to manipulate the SUMO 
pathway and improve their survival [20-25]. This makes 
SUMO a potential target for antiviral therapy. Most 
current knowledge related to SUMO modification and vi- 
ruses has been obtained from studying DNA tumor vi- 
ruses, especially members of the herpesviridae and have 
been inevitably linked to counteracting the host's antiviral 
properties. SUMOylation has been found to affect most of 
the immediate-early and early proteins of herpesviruses, 
which are usually transcriptional factors. BZLFl and Rta 
of Epstein-Barr virus (EBV) [26-29], and the K-bZIP of 
KSHV are three such examples [25] . Viruses are also able 
to directly target the key enzymes of the SUMOylation 
pathway, namely the SUMO El activating enzyme, Aosl/ 
Uba2, the SUMO E2 enzyme, Ubc9, the SUMO E3 ligases, 
and the SUMO protease SENP/Ulp; this allows the virus to 
take charge of the SUMOylation modulating factors in the 
cell [30]. Recently, we identified the first viral SUMO E3 
ligase, KSHV K-bZIP; this enzyme has specificity toward 
SUMO-2/3 [16]. The encoding of a SUMO-2/3 specific 
viral SUMO E3 ligase by KSHV suggests that, potentially, 
KSHV is able to exploit the SUMO pathway to globally 
regulate viral and host transcriptional programs. This, in 
turn, implies that SUMO-2/3 may function in a manner 
that is distinct from SUMO-1 during viral reactivation. 

KSHV, also known as human herpesvirus type 8, is a 
y-herpesvirus associated with Kaposi's sarcoma (KS), pri- 
mary effusion lymphomas (PEL) and multicentric Castle- 
men's disease [31]. It is one of the seven recognized 
human cancer viruses [32]. Like all herpesvirus, KSHV 
has distinct latent and lytic phases. Establishment of la- 
tency is a common property of herpesvirus in infected 
cells and is able to prevent their elimination by the host 
immune response, to maintain life-long infection, and to 
induce tumorigenesis [33,34]. In order to establish infec- 
tion and maintain latency, KSHV has acquired a series 
of different strategies that are able to limit innate anti- 
viral responses and evade host immune surveillance, thus 
allowing the persistence of infection. For example, KSHV 
dedicates a large portion of its genome to encoding cellu- 
lar homologues of host immune modulators and is able to 
express unique viral proteins that have immunomodula- 
tory roles [35,36]. For instance, KSHV-replication and 
transcriptional activator (K-Rta), an immediate early (IE) 
protein of KSHV, which is able to activate a wide spectrum 
of KSHV lytic genes and thereby alone can induce viral re- 
activation, has been found to block the interferon (IFN) 
pathway by targeting interferon regulatory factor (IRF) for 
degradation. The KSHV-encoded basic leucine zipper 
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protein (K-bZIP), one of the earliest viral protein 
expressed right after K-Rta during acute infection and viral 
reactivation, has also been found to inhibit the IFN path- 
way by direct impeding IRF binding to the IFN promoter 
[37,38]. The IFN pathway has also been found to be re- 
pressed by K-bZIP in a SUMOylation-dependent manner 
[39]. Moreover, recent studies have shown that SUMOyla- 
tion of the IRFs occurs during viral infection and these 
changes are essential to allowing the virus to negatively 
regulate the IFN pathway [24,40,41]. Another strategy 
employed by herpesviruses such as HSV-1, the prototyp- 
ical member of the Herpesviridae, is the complete sup- 
pression of cellular gene expression, a process termed host 
shutoff. This phenotype is found during lytic herpesviral 
infection and is believed to play an important role in es- 
tablish herpesviral latency [42]. In HSV-1, the global shut- 
off of host gene expression occurs via two major and 
distinct inhibitory pathways. One is a global increase in 
the rate of mRNA degradation and the other is a virus- 
induced suppression of host mRNA synthesis [43]. For 
KSHV, mRNA degradation is performed by the host shut- 
off factor SOX [44]. The linking of SUMOylation to tran- 
scription repression and the finding that K-bZIP is a 
SUMO-2/3 specific E3 ligase led us to examine the possi- 
bility that there may be global silencing of host genes by 
K-bZIP. We have reported previously that K-bZIP, when 
overexpressed, was indeed a general gene-silencer [45]. 

To gain a better understanding of the differential func- 
tionality of SUMO-1 and SUMO-2/3 conjugation on 
chromatin in transcriptional regulation of host genes 
during KSHV reactivation, we performed a genome-wide 
mapping of chromatin modification by SUMO paralo- 
gues using ChlP-seq, a technology that allows the direct 
identification of all SUMO binding sites on the genome. 
Here, we demonstrate that the chromatin-binding pat- 
terns for SUMO-1 and SUMO-2/3 are very similar in 
the non-reactivated control cells. Interestingly, during 
viral reactivation, distinct dynamic chromatin-binding of 
SUMO paralogues was observed. We have demonstrated 
that the chromatin occupancy of SUMO-2/3 but not of 
SUMO-1 is significantly increased during viral reactiva- 
tion and this enrichment is not randomly distributed. 
Enrichment occurs in promoter regions where transcrip- 
tion factors binds. Potential SUMO-2/3 target TFs on 
the chromatin were identified by annotating SUMO 
peaks in relation to putative transcription factor binding 
sites (TFBS) using the Transfac Matrix Database. Here, 
we provide the first comprehensive profile that compares 
the SUMO-1 and SUMO-2/3 landscapes in the human 
genome and predicts the relevant potential modifying 
TFs that bind to the chromatin. Previous findings from 
yeast study have shown that SUMO is globally associ- 
ated with transcriptionally active genes [7,46] and facili- 
tates the shutting off of induced gene transcription [7]. 



This suggests that SUMO modification may also play a 
global role in transcription regulation in mammals. 
Large scale comparative analysis of ChlP-seq and tran- 
scriptome studies using RNA-seq in this study indicates 
that both SUMO-1 and SUMO-2/3 label the promoters 
of highly active genes in the non-reactivated control 
cells. However, during KSHV reactivation, the SUMO-2/3 
modifications are greatly enriched in the promoters of 
highly active genes that show little change in gene ex- 
pression.Together with previous findings from other stu- 
dies, our results indicate that SUMO-1 and SUMO-2/3 
may play similar roles in maintaining the expression of 
highly transcriptional active genes in non-reactivated 
cells. However, the enrichment of SUMO-2/3 at tran- 
scriptionally active genes that show no change in ex- 
pressional level during viral reactivation suggests that 
SUMO-2/3, but not SUMO-1, ensures the steady-state 
expression of host genes without overt activation du- 
ring viral reactivation. Consistent with studies explo- 
ring the functional analysis of SUMO paralogues in 
specific protein molecules such as Daxx [47], in the 
present study we demonstrate that there are distinct dif- 
ferences in the global roles of SUMO-1 and SUMO-2/3 
in cells that are under stress, such as when there is 
herpesvirus reactivation. 

Methods 

Cell culture 

The TREx-Flag(x3)-His(x3)-K-Rta BCBL-1 cell line 
(TREx-F3H3-K-Rta BCBL-1) and TREx-Flag(x3)-His 
(x3)-K-bZIP BCBL-1 cell line (TREx-F3H3-K-bZIP 
BCBL-1) were grown in RPMI 1640 containing 15% 
FBS, 50 |ig/ml blasticidin and 100 pg/ml hygromycin 
(Invitrogen, Carlsbad, CA). To induce KSHV reactiva- 
tion, 0.2 ng/ml doxycycline (dox) was added to the 
TREx-F3H3-K-Rta BCBL-1 cells. K-Rta expression was 
confirmed by Western blot analysis. 

The shRNA cassette of SUMO-2 (5 -CACCGAGGCA 
GATCAGATTCCGAT TCGAAAATCGGAATCTGATC 
TGCCTC-3 ) and SUMO-3 (5 -CACCGGATGAA TC 
TGTAACTTAACGAATTAAGTTACAGATTCATCC-3 ' ) 
was inserted into pLenti4-Hl/TO shRNA plasmid and in- 
troduced into TREx-F3H3-K-Rta BCBL-1 cells by lenti- 
viral transduction. Cells were selected for 14 days by 300 
l^g/ml zeocine (InvivoGen, ant-zn-1) and purified by Ficoll. 
Knockdown efficiency of SUMO-2/3 by shRNA were 
tested by treated the cells with doxycycline (Dox) for 24 
and 48 hours. TREx-F3H3-K-Rta-shSUMO-2/3 BCBL-1 
cells were maintained as described for TREx-F3H3-K-Rta 
BCBL-1 and supplemented with 300 |ig/ml zeocine. 

Plasmid construction 

The full-length human IRF-1, IRF-2 and IRF-7 cDNAs 
were obtained by reverse-transcription polymerase chain 
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reaction (RT-PCR) using an RNA sample isolated from 
BCBL-1 and using an Oligo(dT) primer. The primers 5 - 
AAACGGTCCGATGCCCATCACTCGGATG-3' and 5'- 
AAACGGACCGCTACG GTGCACAGGGAAT-3 ' were 
used for RT-PCR and cloning of IRF-1, the primers 5 - 
AAACGGTCCGATGCCGGTGGAAAGGATG-3' and 5 - 
AAACGGACCGTTAA CAGCTCTTGACGCG-3 ' were 
used for RT-PCR and cloning IRF-2, and the primers 5 - 
AAACGGTCCGATGGCCTTGGCTCCTGAG-3' and 5 - 
AAACGGACCGCTAG GCGGGCTGCTCCAG-3' were 
used for RT-PCR and cloning IRF-7. The IRF DNA frag- 
ment were then cloned into pcDNA3 and confirmed by 
DNA sequencing. 

Chromatin immunoprecipitation-sequencing (ChlP-Seq), 
ChlP-reChIP assay and real-time quantitative PCR (qPCR) 

After 12 hours of K-Rta induction to allow KSHV reacti- 
vation in the TREx-F3H3-K-Rta BCBL-1 cell line, 1 x 10^ 
cells were harvested. ChIP assays were performed follow- 
ing the protocol described by the Farnham laboratory 
(provided at http://genomics.ucdavis.edu/farnham). ChlP- 
reChlP assays were performed by Re-ChlP-IT kit (Active 
Motif, Carlsbad, CA) following the manufacturer's in- 
struction. Chip grade rabbit polyclone antibodies spe- 
cific against SUMO-1 [Y299] (Abeam, ab32058) and 
against SUMO-2/3 (Abeam, ab3742), as well as rabbit 
non-immune serum IgG (Alpha Diagnostic Inter- 
national, San Antonio, TX), were used for the ChIP and 
ChlP-reChIP assays. Rabbit monoclone antibodies spe- 
cific against IRF-7 (GeneTex, Irvine, CA) was used for 
the ChlP-reChIP assays. 

For ChlP-Seq assay, ChlPed DNA was prepared from 
5 X 10 cells that had been resuspended in 30 ul of ddH20 
and ChlP-seq library construction was then carried by fol- 
lowing the sample preparation protocol from lUumina. 
Short reads (100 bp) from both ends (paired-end sequen- 
cing) of size-selected (400 bp) DNA fragments were se- 
lected and subjected to high throughput sequencing on an 
lUumina" Genome Analyzer// System. The ChlP-Seq data 
was aligned onto the human genome hgl9 build using 
UCSC. Around 6 x 10^ reads were mapped for each sample 
after filtering and quality control (QC) were carried out. In 
this study we used the enriched region detection method of 
Avadis NGS (Strand Scientific Intelligence, San Francisco, 
CA) to localize potential protein binding sites in order to 
delineated the SUMO-1 and SUMO-2/3 binding patterns. 

The binding sites were verified by SYBR° Green Based 
qPCR using a CFX connect'" real-time PCR detection sys- 
tem (Bio-Rad, Richmond, CA). Specific primer sets were de- 
signed around the identified binding sites for this purpose. 

RNA-seq and RT-qPCR analysis 

Total RNA was harvested using TRIzol reagent (Invitrogen, 
Carlsbad, CA) from TREx-F3H3-K-Rta BCBL-1 at 12 and 



24 hours after K-Rta induced viral reactivation according 
to the manufacturer's instructions. RNA-seq was con- 
ducted at the Sequencing Core of National Research Pro- 
gram for Genomic Medicine at National Yang-Ming 
University VYM Genome Research Center using an lUu- 
mina Genome Analyzer//. Sequencing reads were first 
trimmed with human ribosomal RNA sequences (28S, 
18S, 5S, human ribosomal DNA complete repeating unit 
and mitochondrial ribosomal RNA) by Bow-tie (version 
1.0.0) with default parameters and then aligned the high 
quality reads to human reference genome hgl9 using 
TopHat (version 2.0.8b) with Bowtie version 2.1.0 and 
sam tools (version 0.1.9) with transcriptome information 
obtained from Ensembl Release 70 and NonCode v3.0. 
The transcript abundances were estimated in fragments 
per kOobase of transcript per million mapped reads 
(FPKM) by Cufflinks version 2.1.1. Genes from all three 
samples with FPKM > 0.05 were considered to be 
expressed and were used for the remaining analysis. Dif- 
ferential gene expression of the samples (K-Rta induction 
for 12 and 24 hours vs. control) was analyzed by compar- 
ing FPKM. For RT-PCR, 2 \ig of total RNA was reverse- 
transcribed using Superscript™ III First-strand synthesis 
system (Invitrogen) and Oligo-dT. qPCR was carried out 
based on the manufacturer's protocol (iQ SYBR Green 
Supermix, Bio-Rad). 

Immunoprecipitation and western blot analysis 

Transfected 293 T cells were collected in modified radio- 
immune precipitation assay (RIPA) buffer (50 mM Tris- 
HCl [pH 6.7], 1% NP-40, 0.25% sodium deoxycholate, 150 
mM NaCl, 1 mM EDTA) supplemented with IX protease 
inhibitor cocktail (Roche, Penzberg, Germany). TCLs were 
incubated with anti-FLAG M2-agarose (Sigma- Aldrich, St. 
Louis, MO) overnight at 4°C. Immune complexes were 
captured by protein A and protein G Sepharose beads. 
Beads were washed, and the bound proteins were analyzed 
by immunoblotting. Antibodies used for immunoblotting 
were anti-IRF-1 (Cell Signaling Technologies, Beverly, 
MA), anti-IRF-2 (Cell Signaling Technologies), anti-IRF-7 
(GeneTex), anti-SUMO-2/3 antibodies. 

Results 

Global identification of the chromatin binding patterns 
of SUMO paralogues reveals that KSHV reactivation Is 
associated with specific enrichment of SUIV\0-2/3 

SUMO modifications of transcription regulatory proteins 
and chromatin modifying enzymes are linked to the epi- 
genetic regulation of gene transcription. SUMO-1 and 
SUMO-2/3 have both common and distinct substrates, 
but their global functional roles in epigenetic regulation 
have not as yet been fully investigated. As mentioned 
earlier, DNA viruses have evolved different strategies 
that allow them to manipulate the SUMO pathway in a 
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manner that helps their survival. We previously identi- 
fied a KSHV lytic protein, K-bZIP, as a viral SUMO E3 
ligase with specificity toward SUMO-2/3 [16]. Using 
KSHV as a model in conjunction with ChlP-seq to inter- 
rogate the binding sites of the various SUMO paralogues 
during viral reactivation, we hoped to distinguish the 
epigenetic regulatory role of the SUMO isoforms during 
viral replication. KSHV is a particularly attractive model 
as its reactivation can be switched on by the expression of 
a single K-Rta gene, and a well characterized Dox- 
inducible TREx-F3H3-K-Rta BCBL-1 cell line is available 
for this purpose. This study has the ability to pinpoint the 
global functional differences in terms of epigenetic regula- 
tion of the SUMO isoforms during viral pathogenesis. 

To study the epigenetic regulation of SUMO paralogues 
in association with KSHV reactivation, the genome-wide 
in vivo binding sites of SUMO-1 and SUMO-2/3 were an- 
alyzed using massively parallel chromatin immunoprecipi- 
tation in combination with high throughput sequencing 
(ChlP-Seq); these processes were carried out on a K-Rta- 
inducible KSHV infected primary effusion lymphoma 
(PEL) cell line, TREx-F3H3-K-Rta BCBL-1. Chromatin 
samples from TREx-F3H3-K-Rta BCBL-1 cells before and 
after K-Rta induction to allow KSHV reactivation were 
isolated and subjected to the ChIP assay using ChIP grade 
SUMO-1 and SUMO-2/3 antibodies. High-throughput se- 
quencing was then performed to measure the binding of 
SUMO-1 and SUMO-2/3 from a single run of ChIP assay. 
Approximately the same number (6 x 10^) of reads from 
the KSHV un-induced and induced samples were mapped 
to the human reference genome, hgl9. Using an enrich- 
ment peak calling algorithm, we found a total of 31315 
and 45846 high confidence SUMO-1 and SUMO-2/3 



enrichment regions, respectively, in the non-reactivated 
control cells. After K-Rta induction for 12 hours, a total of 
39626 SUMO-1 enrichment regions (an increase in 8 K 
peaks of ~ 1.3-fold compared to the control cells) and 86479 
high confidence SUMO-2/3 enrichment regions (an increase 
in 40 k peal<s of ~ 1.9-fold compared to the control cells) 
were identified (Figure lA). Consistent with our previous 
fmdings showing that KSHV encodes a SUMO-2/3 specific 
E3 ligase in its lytic phase [16], there was a significant in- 
crease in SUMO-2/3 modification across human genome, 
whereas SUMO-1 modification showed a relatively similar 
occupancy abet with a slight increase; these findings, suggest 
that KSHV specifically exploits SUMO-2/3 in order to regu- 
late viral and host transcriptional programs. 

Common binding sites that were shared by the SUMO 
paralogues were assessed by examining the overlap in their 
binding profiles between the K-Rta induced and non- 
induced states. The results showed that around 30% of the 
SUMO-1 and SUMO-2/3 binding sites under both condi- 
tions showed colocalization (Figure IB). Interestingly, the 
number of SUMO-2/3 specific binding sites was increased 
by 12% (fi-om -27 k to -57 k) whOe the number of SUMO- 
1 specific binding sites was decreased by 11% (from -13 k 
to -10 k) during viral reactivation (Figure IB). These find- 
ings suggest that KSHV reactivation is accompanied by sig- 
nificant changes in the magnitude of SUMO-2/3 tagging 
across the genome. By contrast, the level of SUMO-1 tag- 
ging on chromatin remains relatively unchanged. 

SUMO-2/3 is enriched on the promoter regions after 
KSHV reactivation 

SUMO is capable of binding to chromatin by modify- 
ing chromatin remodeling proteins and transcription 
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Figure 1 Overview of ChlP-seq data showing chromatin occupancy of SUMO paralogues during KSHV reactivation. (A) Histograms of 
ChlP-seq profiles across chromosome 1 and 8 for SUMO-1 and SUMO-2/3 binding sites before and after KSHV reactivation. (B) Overlap of 
SUMO-1 and SUMO-2/3 binding sites before and after KSHV reactivation in BCBL-1 cells; numbers indicate counts for overlapping and 
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factors [48] . In this context, SUMO modifications are likely 
to have specific distributions across the genome. As ex- 
pected, SUMO target sites occur across all chromosomes 
but are not randomly distributed. They are enriched in 
regions containing genes, notably in regions annotated as 
promoters. As Figure 2A reveals, chromatin-bound SUMO 
paralogues are commonly centered and symmetrically dis- 
tributed within 500 bp around transcription start sites 
(TSSs). This pattern is similar to that reported for chro- 
matin modification by SUMO-1 through the cell cycle 
[46]. Interestingly, after overlaying the SUMO-binding 
data after viral reactivation onto the control SUMO- 
binding data, we discovered that there was a significant in- 
crease in SUMO-2/3 occupancy near to TSSs (Figure 2C) 
after KSHV reactivation, whereas SUMO-1 occupancy 
showed a slight decrease (Figure 2B). A similar pattern 
was identified for the groups of peaks that contain only 
SUMO-1 or SUMO-2/3 specific modification. 

Consistent results were obtained when SUMO peaks 
were normalized for the size of defined genome compart- 
ment. SUMO paralogues showed a relative higher peak 
density in promoter regions (TSSs ± 500 bp), whereas the 



binding to the gene bodies themselves (transcribed re- 
gions), the transcription end sites (TESs), the regions up- 
stream of the gene, the regions downstream of the gene 
and the intergenic regions were low (Figure 2D and 2E). 
Consistently, the peak density of SUMO-2/3 (Figure 2E), 
but not of SUMO-1 (Figure 2D), was significantly in- 
creased in the promoter regions during viral reactivation. 
These findings indicate that, while the chromatin-bound 
SUMO paralogues are both centered on the TSSs, only 
chromatin-bound SUMO-2/3 is significantiy increased 
during KSHV reactivation. 

Global prediction of potential SUMO-I and SUI\/10-2/3 
targeting of chromatin-bound transcription factors 

Typical SUMO binding sites are focal and consist of no 
more than a few hundred base pairs, a pattern reminis- 
cent of the "peaks" associated with transcription factors. 
A large number of known SUMO conjugates in mam- 
mals are transcription factors. To predict the potential 
chromatin-associated transcription factors (TFs) that are 
SUMOylated, we annotate SUMO target sites within 
promoter regions in relation to transcription factor 




Figure 2 Genome-wide analysis of SUMO-1 and SUI\/IO-2/3 binding region during KSHV reactivation. (A-C) Distance distribution of all 
SUMO-1 and SUMO-2/3 peaks. Distance to transcription start site (TSS) before (A) and after (B and C) K-Rta induction for viral reactivation. 
(D and E) Gene context of SUMO-1 (D) and SUMO-2/3 (E) binding sites during KSHV reactivation represented by peak density, after adjustment 
for the prevalence of gene context category in the genome. Promoter: TSS ± 500; Promoter pro. (Promoter proximal): TSS ± 2 Kb; 3' end: TES ± 
500; Upstream: -2 kb to -10 kb upstream of the TSS; Downstream: -t2 kb to -tlO kb downstream of the TES; Intergenic: >10 kb from any coding 
genes. TES: transcription end site. 
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binding sites (TFBS) from the Transfac Matrix Database 
(v7.0) created by Biobase. This database contains 258 
TFBS weight matrices that represent the potential DNA 
binding sites of 176 TPs across the genome. The SUMO 
enriched peaks for each TFBS in the promoter region 
were normalized with their own distribution frequency. 
Potential SUMO target TFs were ranked by percentage 
and Hampel Identifier was used to identify the TFBSs 
that were significantly correlated with SUMO binding 
[49]. The number of SUMO peaks identified before viral 
reactivation was used as the control. High confidence 
TFBSs that correlated with SUMO-1 and SUMO-2/3 
peaks were mapped and are represented here as "poten- 
tial SUMO-1 and SUMO-2/3 target TFs". Interestingly, 
during viral reactivation, the SUMO-1 target TFs de- 
creased from 18 to 10, while the SUMO-2/3 target TFs 
were significantly increased from 22 to 86 (Figure 3A). 
When we overlapped the potential SUMO-1 and 
SUMO-2/3 target TFs, we found that 74% of the TFs 
shared by SUMO paralogues in the non-reactivated con- 
trol cells and this decreased (10%) during viral reactiva- 
tion (Figure 3A). Around 20 ~ 30% of the potential 
SUMO-1 and SUMO-2/3 target TFs overlapped before 
and after KSHV reactivation (Figure 3B). Moreover, 
SUMO-1 target TFs consisted of more non-overlapping 
TFs before viral reactivation (11 of 21), while, on the 
other hand, there were more non-overlapping TFs for 
SUMO-2/3 (65 of 87) that were recognized after viral re- 
activation (Figure 3B). Collectively, these results suggest 
that SUMO-2/3 significantly increased its tagging of TFs 
bound to promoter regions during KSHV reactivation. 

The top twenty potential SUMO-1 and SUMO-2/3 
target TFs before and after KSHV reactivation are listed 
in Tables 1, 2, 3 and 4. If less than twenty TFs have been 
identified, all of them are listed. One interesting point 
involves the reasons for SUMO-2/3 target TFs iden- 



tification after viral reactivation, which is quite different 
from that for SUMO-1 target TFs. As shown in Table 4, 
there is a more significant increase in peak numbers for 
the top-20 SUMO-2/3 target TFs after KSHV reactivation 
comparing with the top twenty TFs listed from the non- 
reactivated control cells (Table 2). In contrast, the peak 
number for the SUMO-1 target TFs after viral reactivation 
almost all decreased (Table 3); this decrease is less than 
that of the control (Table 1). The findings indicate an in- 
crease in SUMO-2/3 modification of TFs during viral re- 
activation. In contrast, SUMO-1 modification of TFs after 
viral reactivation had decreased. These finding suggest 
that SUMO paralogues are differentially regulated in a glo- 
bal manner under certain circumstances, for example 
when there is viral reactivation as is the case here. The 
SUMO-1 and SUMO-2/3 specific TFBSs identified here 
provides a framework that allows the study of the potential 
functional differences between SUMO paralogues. 

Identification of potential transcription factors targeted 
by SUMO-2/3 during KSHV reactivation 

The presence of highly enriched SUMO-2/3 binding 
sites around the promoter regions during viral reactiva- 
tion suggest that SUMO-2/3 might be directly or indir- 
ectly targeting a large group of transcription factors 
during KSHV reactivation. In order to pinpoint the most 
important gene-regulating TFs that are targeted by 
SUMO-2/3 during viral reactivation, we collected genes 
with SUMO-2/3 targeted TFBSs at their promoter be- 
fore and after viral reactivation and group them into an 
up-group (SUMO peaks increase >1.5X), a down-group 
(SUMO peaks decrease >1.5X) and a no-change-group 
(SUMO peaks variants within 1.5X). When we ranked 
the TFs by total gene number, the top 10 most import- 
ant gene-regulating TFs targeted by SUMO-2/3 after 
KSHV reactivation could be identified (Figures 4 and 5). 



A 



^— 50% 



B 



□ SUMO-1 
13SUMO-2 3 
Hoveriap 



50% 

H 



o * 



□ Oh 
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c 



o 



o 



Oh 



14 



□ SUMO-1 

□ su^o-2': 

□ overlap 



12h 



Oh I in SUMO-1 SUMO-2/3 

Time ChlP enrichment Time 

Figure 3 Overview of potential TFs targeting by SUMO paralogues during KSHV reactivation. (A) Percentage of overlapped TFs target 
sites for SUMO-1 and for SUMO-2/3 before and after K-Rta induction resulting in KSHV reactivation (B) Percentage of overlapped SUMO-1 or 
SUMO-2/3 target TFs before and after viral reactivation. (C) Percentage overlap of the top-20 potential TF targets for SUMO-1 and SUMO-2/3 
before and after viral reactivation; numbers indicate overlapping and non-overlapping TF counts. 
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Table 1 Top 20 potential SUMO-1 targeting TFs before KSHV reactivation 

Rank Name Transfac TFBS # Peak # of TFBS % of TFBS Peak Gene # 

(in TSS±500 bp) (in TSS±500 bp) (in TSS±500 bp) 











0 h 


12 h 


Oh 


12 h 


0 h 


12 h 


1 


ETS1 


V$CETS1PS4_01 


1587 


341 


262 


21.5% 


1 6.5% 


294 


229 


2 


GABPA/NRF-2 


V$NRF2_01 


1577 


336 


237 


21.3% 


1 5.0% 


286 


206 


3 


ELK1 


V$ELK1_02 


1667 


333 


254 


20.0% 


1 5.2% 


279 


215 


4 


USF1 


V$USF_C 


441 


79 


45 


1 7.9% 


10.2% 


71 


44 






V$USF_Q6 


1136 


146 


97 


1 2.9% 


8.5% 


132 


92 


5 


STAT1 


V$STAT1_03 


282 


50 


41 


1 7.7% 


14.5% 


48 


39 






V$STAT1_01 


1179 


146 


122 


1 2.4% 


1 0.3% 


134 


110 


5 


STAT4 


V$STAT4_01 


106 


17 


17 


16.0% 


16.0% 


4 


4 


7 


MYC/MAX 


V$MYCMAX_02 


580 


90 


76 


1 5.5% 


13.1% 


68 


55 






V$MYCMAX_01 


784 


104 


88 


1 3.3% 


1 1 .2% 


93 


68 


8 


CREB1 


V$CREB_01 


666 


103 


87 


1 5.5% 


13.1% 


97 


83 


9 


MAX 


V$MAX_01 


584 


87 


68 


14.9% 


1 1 .6% 


75 


64 


10 


ATF-2 


V$CREBP1_01 


272 


37 


30 


1 3.6% 


1 1 .0% 


35 


28 


11 


STAT3 


V$STAT3_01 


986 


133 


108 


1 3.5% 


1 1 .0% 


123 


101 


12 


SREBF1/SREBP1 


V$SREBP1_01 


1038 


140 


108 


1 3.5% 


10.4% 


110 


93 


13 


MYCN 


V$NMYC_01 


1467 


195 


125 


1 3.3% 


8.5% 


170 


114 


14 


STAT6 


V$STAT_01 


570 


75 


59 


1 3.2% 


1 0.4% 


66 


54 


15 


ARNT 


V$ARNT_01 


835 


108 


89 


1 2.9% 


10.7% 


102 


87 


16 


ARID5B/MRF2 


V$MRF2_01 


478 


61 


57 


12.8% 


1 1 .9% 


56 


52 


17 


E2F1/2/3/4/5 


V$E2F_02 


1301 


166 


121 


12.8% 


9.3% 


149 


110 


18 


PBX1 


V$PBX1_01 


220 


28 


21 


12.7% 


9.5% 


28 


21 



bold: transcription factor with evidence of SUIVIO modification. 



Interestingly, we found that there were three IRFs, 
IRF-7, IRF-1, and IRF-2 that do not exist in the SUMO- 
2/3 target TFs list before viral reactivation, but are now 
listed as the 4th, 5th and 6th top-most TFs, respectively, 
after viral reactivation. Comparing the SUMO tagging of 
these IRFs before and after viral reactivation, we found 
that all three IRFs binding sites are preferentially sub- 
jected to SUMO-2/3 modification after viral reactivation 
(Figure 6). To confirm that SUMO-2/3 enrichment at 
the IRF-1, IRF-2 and IRF-7 binding sites occurs during 
viral reactivation, we design primers targeting the IRFs 
binding regions where the SUMO-2/3 peaks had been 
identified by the ChlP-seq assay. The SUMO-2/3 enrich- 
ment in those regions was then validated using a ChIP 
sample and real-time quantitative-PCR (qPCR). Consist- 
ent with the ChlP-seq results, the 12 IRF binding re- 
gions tested here showed significant enrichment after 
viral reactivation for SUMO-2/3 but not for SUMO-1 
compare to the non-reactivated control cells (Figure 7). 
ChlP-reChIP analyses further confirmed the co- 
localization of IRF-7 and SUMO-2/3 on IRF-7 binding 
region with SUMO-2/3 enrichment (Figure 8). 



The identification of IRF-1, IRF-2 and IRF-7 as poten- 
tial SUMO-2/3 targets during KSHV reactivation sug- 
gests that the viral SUMO E3 ligase K-bZIP may be 
involved in this phenomenon. To address this, we first 
cloned all three IRFs using cDNA of BCBL-1 cells. Then 
293 T cells were transiently co-transfected with Flag- 
tagged IRF-1, IRF-2 or IRF-7, T7-tagged SUMO-2 and 
SUMO-3, and HA-tagged K-bZIP; this was followed by 
immunoblotting or immunoprecipitation using Flag 
antibody. The results showed an increase in SUMO 
modification of IRF-1 and IRF-2 when there was overex- 
pression of K-bZIP (Figure right panel of 9A and B). 
SUMOylation of IRF-1 and IRF-2 was further confirmed 
by immunoblotting using anti-SUMO-2/3 antibody 
(Figure left panel of 9A and B). Although we were un- 
able to identify SUMOylation of IRF-7 using this ap- 
proach (data not shown), an immunoprecipitation assay 
showed that IRF-7 is able to interact with K-bZIP 
(Figure 9C and D). Thus SUMO-mediated transcription 
regulation not only involves covalent SUMO modifica- 
tion of transcription regulatory proteins, but also seems 
to involve SUMO modified co-regulatory proteins that 
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Table 2 Top 20 potential SUMO-2/3 targeting TFs before KSHV reactivation 



Rank 


Name 


Transfac 


TFBS # 


Peak # of TFBS 


% of TFBS Peak 


Gene # 








(in TSS±500 bp) 


(in TSS±500 bp) 


(in TSS±500 bp) 














0 h 


12 il 


Oil 


12 h 


0 h 


12 h 


1 


ETS1 


V$CETS1PS4_01 


1587 


337 


397 


212% 


25.0% 


296 


353 


2 


GABPA/NRF-2 


V$NRF2_01 


1577 


320 


379 


20.3% 


24.0% 


274 


335 


3 


ELK1 


V$ELK1_02 


1667 


336 


382 


202% 


22.9% 


285 


328 


4 


STAT4 


V$STAT4_01 


106 


20 


12 


1 8.9% 


1 1 .3% 


7 


12 


5 


STAT1 


V$STAT1_03 


282 


53 


79 


1 8.8% 


28.0% 


45 


68 


6 


CREB1 


V$CREB_01 


666 


124 


164 


1 8.6% 


24.6% 


114 


152 


7 


ATF2 


V$CREBP1_01 


272 


45 


67 


1 6.5% 


24.6% 


45 


64 






V$CREBP1_Q2 


1273 


168 


201 


1 3.2% 


1 5.8% 


135 


198 


8 


STAT6 


V$STAT_01 


570 


92 


122 


16.1% 


21.4% 


85 


110 


9 


MYC/MAX 


V$MYCMAX_02 


580 


93 


96 


16.0% 


16.6% 


74 


87 


10 


MAX 


V$MAX_01 


584 


93 


110 


1 5.9% 


18.8% 


79 


97 


11 


HLF 


V$HLF_01 


438 


69 


90 


1 5.8% 


20.5% 


66 


85 


12 


USF1 


V$USF_C 


441 


69 


86 


1 5.6% 


1 9.5% 


64 


78 


13 


ARID5B/MRF2 


V$MRF2_01 


478 


74 


100 


1 5.5% 


20.9% 


68 


91 


14 


NFYA/B/C 


V$NFY_Q6 


1557 


21 7 


295 


1 3.9% 


1 8.9% 


176 


250 






V$NFY_C 


416 


57 


79 


1 3.7% 


1 9.0% 


55 


75 


15 


SREBF1/SREBP1 


V$SREBP1_01 


1038 


144 


179 


1 3.9% 


1 7.2% 


100 


170 


16 


STAT5A 


V$STAT5A_02 


982 


135 


155 


1 3.7% 


1 5.8% 


109 


133 


17 


ZEB1 


V$AREB6_04 


530 


70 


89 


1 3.2% 


16.8% 


60 


78 


18 


ARNT 


V$ARNT_01 


835 


110 


136 


1 3.2% 


16.3% 


106 


131 


19 


E2F1/2/3/4/5 


V$E2F_02 


1301 


171 


226 


13.1% 


1 7.4% 


155 


199 


20 


STAT3 


V$STAT3_01 


986 


129 


168 


13.1% 


1 7.0% 


127 


159 


bold: transcription factor with evidence of SUIVIO modification. 














show a 


non-covalent 


association at the 


TFBS. These 


SUIVlO-2/3 is enriched on promoters of immune-related 


findings 


suggest that 


IRF-7 may recruit 


K-bZIP to its 


genes that are unaltered during KSHV reactivation 




binding 


sites together with other K-bZIP SUMOylated 


To study the functional role of SUMO-2/3 


in the 


regula- 


chromatin binding protein(s); these are then able to be 


tion of gene 


expression during 


KSHV reactivation, we 


co-immunoprecipitated (co-IPed) by SUMO antibody. 


conducted a 


detailed 


RNA-seq 


analysis 


using 


TREx- 


Table 3 Top 20 potential SUMO-1 targeting TFs after KSHV reactivation 












Rank 


Name 


Transfac 


TFBS # 


Peak # of TFBS 


% of TFBS Peak 


Gene # 








(in TSS±500 bp) 


(in TSS±500 bp) 


(in TSS±500 bp) 














0 h 


12 h 


0 h 


12 h 


0 h 


12 h 


1 


ETS1 


V$CETS1P54_01 


1587 


341 


262 


21.5% 


1 6.5% 


294 


229 


2 


STAT4 


V$STAT4_01 


106 


17 


17 


1 6.0% 


1 6.0% 


4 


4 


3 


ELK1 


V$ELK1_02 


1667 


333 


254 


20.0% 


1 5.2% 


279 


215 


4 


GABPA/NRF-2 


V$NRF2_01 


1577 


336 


237 


21.3% 


1 5.0% 


286 


206 


5 


STAT1 


V$STAT1_03 


282 


50 


41 


1 7.7% 


14.5% 


48 


39 


6 


NFATC1/2/3/4 


V$NFAT_Q6 


487 


48 


70 


9.9% 


14.4% 


22 


23 


7 


STAT5A 


V$STATSA_02 


982 


118 


129 


1 2.0% 


13.1% 


95 


104 


8 


MYC/MAX 


V$MYCMAX_02 


580 


90 


76 


1 5.5% 


13.1% 


68 


55 


9 


CREB1 


V$CREB_01 


666 


103 


87 


1 5.5% 


13.1% 


97 


83 


10 


CUTL1 


V$CDPCR1_01 


582 


59 


73 


10.1% 


1 2.5% 


54 


49 



bold: transcription factor with evidence of SUMO nnodification. 
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Table 4 Top 20 potential SUMO-2/3 targeting TFs after KSHV reactivation 



^ank 


Name 


Transfac 


TFBS # 
(in TSS±500 bp) 


Peak # of TFBS 
(in TSS±500 bp) 


% of TFBS Peak 
(in TSS±500 bp) 


Gene # 










0 h 


12 h 


0 h 


12 h 


0 h 


12 h 


1 


STAT1 


V$STAT1_03 


282 


53 


79 


1 8.8% 


28.0% 


45 


68 


2 


ETS1 


V$CETS1P54_01 


1587 


337 


397 


21.2% 


25.0% 


296 


353 


3 


PBX1 


V$PBX1_01 


220 


28 


55 


1 2.7% 


25.0% 


28 


54 


4 


ATF2 


V$CREBP1_01 


272 


45 


67 


1 6.5% 


24.6% 


45 


54 


5 


CREB1 


V$CREB_01 


666 


124 


164 


1 8.6% 


24.6% 


114 


152 


6 


GABPA/NRF-2 


V$NRF2_01 


1577 


320 


379 


20.3% 


24.0% 


274 


335 


7 


ELK1 


V$ELK1_02 


1667 


336 


382 


20.2% 


22.9% 


285 


328 


8 


IRF-1 


V$IRF1_01 


534 


46 


118 


8.6% 


22.1% 


43 


113 


9 


STAT6 


V$STAT_01 


570 


92 


122 


16.1% 


21.4% 


85 


110 


10 


F0X04 


V$FOXO4_02 


456 


28 


97 


6.1% 


21.3% 


28 


75 


11 


NFATC1/2/3/4 


V$NFAT_Q6 


487 


50 


102 


1 0.3% 


20.9% 


29 


55 


12 


ARID5B/MRF2 


V$MRF2_01 


478 


74 


100 


1 5.5% 


20.9% 


68 


91 


13 


IRF-2 


V$IRF2_01 


601 


48 


125 


8.0% 


20.8% 


63 


93 


14 


SRY 


V$SRY_01 


523 


32 


108 


6.1% 


20.7% 


30 


91 


15 


HLF 


V$HLF_01 


438 


69 


90 


1 5.8% 


20.5% 


66 


85 


16 


IRF-7 


V$IRF7_01 


754 


87 


153 


1 1 .5% 


20.3% 


79 


125 


17 


MEF2A 


V$MEF2_01 


455 


36 


91 


7.9% 


20.0% 


28 


72 


18 


GATA1 


V$GATA1_04 


191 


14 


38 


7.3% 


1 9.9% 


13 


35 


19 


POU3F2 


V$POU3F2_02 


246 


17 


48 


6.9% 


1 9.5% 


17 


43 


20 


USFl 


V$USF_C 


441 


69 


86 


1 5.6% 


1 9.5% 


64 


78 



bold: transcription factor with evidence of SUMO nn edification. 



F3H3-K-Rta BCBL-1 cells before and after K-Rta induc- 
tion for viral reactivation. We sorted 26008 genes using 
expression levels based on FPKM into five groups, 
namely no expression (FPKM <0.05: 7954), low expres- 
sion (FPKM 0.05 ~ <1: 6503), medium expression (FPKM 
1 ~ <10: 5597), high expression (FPKM 10 ~ <100: 5390), 
and very high expression (FPKM >100: 564). We found 
that between 27% and 37% of the very high and high ex- 
pression group, about 16% of the medium expression 
group, about 4% of the low expression group, and about 
1% of the no expression group promoters were labeled 
by SUMO-1 or SUMO-2/3 (Figure lOA). Consistent 
with a previous study using yeast [7] and a study of 
SUMO-1 using HeLa cells [46], both of which showed 
that SUMO preferentially occupies transcriptionally 
active genes, the modification by all SUMO paralo- 
gues explored here also seems to be present at greater 
levels on the promoters of genes that show a higher 
level of expression. Twenty-four hours after KSHV re- 
activation, we found a significant increase in SUMO- 
2/3 binding at the promoters of genes with high ex- 
pression (~15%) and medium expression (-10%). This 
compared with little SUMO-2/3 binding enrichment 



at the promoters of genes with low expression (-2%) 
and no expression (<1%) (Figure IOC). In contrast, 
there was a slight decrease in SUMO-1 modification 
across all expression categories (Figure lOB). These 
results indicate that SUMO-1 and SUMO-2/3 modifica- 
tions are important for maintaining the transcription pro- 
files of the non-reactivated control cells. Nevertheless, the 
increase in SUMO-2/3 modification during KSHV reacti- 
vation supports the notion that specific SUMO-2/3 
targeting is important to transcription regulation during 
the KSHV life cycle. 

In addition to maintaining constitutive transcription, 
SUMO has also been found to prevents the overt activa- 
tion of induced genes by facilitating the shut off of the 
transcription in yeast [7]. To assess the effect of SUMO- 
2/3 enrichment on the shut-off of transcription, we com- 
pared global host gene expression in BCBL-1 cells before 
and after KSHV reactivation. We found that among 
the ~ 18,000 transcriptionally active host genes (genes 
with FPKM >0.05), only -2,600 of the up-regulated genes 
and ~2,200 of the down-regulated genes were changed 
more than 1.5-fold in response to KSHV reactivation for 
24 hours. A similar result was found at 12 hours after viral 
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Name 



Transfac 



Logo 



Gene # 

Up No change Down 



ETSl 



ELKl 



NFYA/B/C 



STAT3 



CREBl 



STATS A 



V$CETS1P54_01 



GABPA/NRF-2 V$NRF2_01 



V$ELK1_02 



VSNFY_Q6 



E2F1/2/3/4/5 V$E2F_02 



V$CREBP1_Q2 



V$STAT3_01 



SREBF1/SREBP1 V$SREBP1_01 



V$CREB_01 



V$STAT5A_02 



c3GAAG_- 
- ^ -CCAaT 
^^TGAC x^-^_ 

axTTCCc AA^x- 

tCAcsTG^-_ 



192 



188 



169 



139 



108 



102 



100 



93 



87 



81 



147 



130 



141 



106 



85 



75 



54 



46 



58 



50 



79 



82 



42 



35 



43 



43 



38 



30 



31 



Figure 4 Top 10 potential SUMO-2/3 targeting TFs ranlcing by gene number before KSHV reactivation, bold: transcription factor with 
evidence of SUMO modification. 



reactivation (-1,900 up-regulated and ~2,100 down- 
regulated genes). Analysis of the SUMO peak distributions 
within the promoter region of these transcriptionally up- 
regulated, down-regulated, and no change genes showed 
that the predominant association is between SUMO en- 
richment peaks and genes that can be shown to exhibit no 
change in expression. After viral reactivation for 24 
hours, there was a significant increase in SUMO-2/3 re- 
cruitment to the promoters of transcriptionally un- 
altered genes (Figure llA and B). When we further 
grouped the SUMO peaks into increased binding, de- 
crease binding and no change in binding during viral re- 
activation, a similar result was found (Figure IIC to E). 
SUMO-2/3 peak enrichment during viral reactivation 
was predominantly associated with transcriptionally un- 
altered genes. When we analyzed the association of the 
expression level of the transcriptionally up-regulated, 
down-regulated and unaltered genes with SUMO peaks 
during viral reactivation, we found that most of the viral 
up-regulated (>80%) and down-regulated (>90%) genes 
fall into the low (FPKM 0.05 ~ 1) and no expression 
(FPKM <0.05) gene categories that show litde SUMO-2/3 
modification on their promoter (Figure 12A and B). 



Furthermore and interestingly, more than 75% of the 
expression-unchanged genes, which contain a higher 
proportion of SUMO-2/3 modification on their pro- 
moter, are in the medium and high expression gene 
categories (Figure 12C and Figure 13). This result indi- 
cates that SUMO-2/3 enrichment during viral reactivation 
may contribute to "stabilizing" the transcriptional activity 
of these medium (FPKM 1 ~ 10) and high (FPKM >10) 
expression genes during viral reactivation. To confirm the 
RNA-seq data, we design primers for IRF-1, IRF-2 and 
IRF-7 targeted and transcriptionally active genes that 
show no change in expression during viral reactiva- 
tion in BCBL-1 cells. The lack of change in expres- 
sion during viral reactivation was confirmed using 
cDNA samples and real-time qPCR. Consistent with 
the RNA-seq results, the 12 genes tested here showed 
no changes in expression level compared to the con- 
trol after K-Rta-induced KSHV reactivation (Figure 14). 
To further study the "stabilizing" potential of SUMO-2/3 
in transcription regulation, we generated an inducible 
SUMO-2/3 knockdown BCBL-1 cell line, TREx-F3H3-K- 
Rta-shSUMO-2/3 BCBL-1. Western blot analysis shows 
the successful knockdown of SUMO-2/3 at 24 and 48 
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Figure 5 Top 10 potential SUMO-2/3 targeting TFs ranl(ing by gene number after KSHV reactivation, bold: transcription factor with 
evidence of SUMO modification. 



hours after induction (Figure 14A). Consistent with our 
hypothesis, qPCR analysis showed a higher induction 
of most of the 12 genes we analyzed after SUMO-2/3 
knockdown during viral reactivation (Figure 14B). Again, 
these results imply that SUMO-2/3 enrichment within 
the host promoter region during KSHV reactivation is 
closely related to preventing transcriptional activation 
of constitutively active host genes, many of which are 
immune response genes (see below). 

To determine whether SUMO-2/3 target a group of 
genes with specific functions, we carried out a gene 
ontology (GO) analysis of genes that are targeted by 
SUMO-1 and SUMO-2/3 before and after viral re- 
activation using the IPA software. We found that the 
genes targeted by SUMO-2/3 after viral reactivation are 
significantly involved in several pathways related to cel- 
lular immune responses, cytokine signaling, cell growth. 



apoptosis and cancer (Table 5). We further analyzed 
SUMO targeted genes with no change in transcription 
and genes with no change in transcription but increased 
in SUMO-2/3 enrichment. Consistently, we found that 
the transcriptionally unaltered genes targeted by SUMO- 
2/3 after viral reactivation are significantly involved in 
cellular immune responses (Table 6). Taken together, all 
of the present results support the notion that KSHV 
may target SUMO-2/3 modified proteins to active chro- 
matin regions to prevent overt activation of various im- 
portant genes during viral reactivation, especially those 
involved in the innate immune response. 

Discussion and conclusions 

SUMO is a multifaceted modifier of chromatin structure. 
SUMO modification of chromatin proteins regulates a 
range of cellular processes including transcription, repli- 
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Figure 8 IRF-7 and SUMO-2/3 co-localized at IRF-7 binding sites 
with SUMO-2/3 binding enrichment after KSHV reactivation. 

Quantification of DNA recovered from DAPPl and KIAA1370 
promoters by real-time qPCR after enrichment by ChIP with rabbit 
non-immune serum IgG or anti-SUMO-2/3 antibodies and reChIP of 
IRF-7 with anti-IRF-7 antibody. One non-IRF-7 target gene, KIAA1033, 
was used in qPCR as negative controls. 



cation, DNA repair and chromosome segregation. 
SUMOylation has long been believe to be associated with 
gene silencing or repression. However, global mapping of 
chromatin binding by SUMO in yeast [7] and Drosophila 
[50], show that SUMOylated proteins are present at tran- 
scriptionally active and induced genes. This discovery led 
to the hypothesis that SUMO functions to prevent super- 
induction of actively transcribed genes by external factors 
(in this case, viral infection) to maintain a steady-state 
level of transcription. However, lower eukaryotes possess 
only one SUMO isoform, whereas there are two groups of 
SUMO variants in humans; SUMO-l and SUMO-2/3. Re- 
cently, the global chromatin localization of SUMO-l 
through the cell cycle of human HeLa cells has been iden- 
tified. Similar to that reported in yeast, SUMO-l tends to 
cluster around transcriptionally active genes [46]. Al- 
though increasing evidence from studies targeting specific 
cellular factors suggests that there is differential conjuga- 
tion and functionality among SUMO paralogues, the glo- 
bal functional heterogeneity of human SUMO paralogues 
seems to be limited in their conjugation dynamics [11,12] 
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and subcellular localizations [51]. The global functional 
differences between SUMO paralogues in terms of epigen- 
etic regulation remains a puzzle. In this study, we com- 
pared the chromosome-wide labeling of SUMO-1 and 
SUMO-2/3 proteins before and after herpesvirus reactiva- 
tion using the ChlP-Seq assay. We found that firstiy, on a 
genome-wide scale, the binding profile of the SUMO 



paralogues was highly similar in the control cells, but that 
differences were evident after KSHV reactivation with 
there being a significant increase in SUMO-2/3 binding 
while there was only limited changes in the SUMO-1 
binding profile. Secondly, the distribution of both SUMO 
paralogues on the chromatin showed a greater tendency 
toward being associated with transcription regulatory 
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regions (promoters) and that, furthermore, the binding of 
SUMO-2/3 onto the promoter regions was significantly 
increased during viral reactivation. Thirdly, there was a 
dramatic increase in SUMO-2/3 binding and a slight de- 
crease in SUMO-l binding onto TFBSs during viral reacti- 
vation. Fourthly, the potential SUMO-l and SUMO-2/3 



target TPs highly overlapped in the control cells, while the 
SUMO-2/3 specific TPs are significantly increased during 
viral reactivation. Pifthly, three IRPs, "the master regula- 
tors of immune responses" show up in the top-10 most 
important gene-regulating TPs targeted by SUMO-2/3 
after KSHV reactivation. Sixth, both the SUMO paralogues 
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Figure 13 Histogram of SUMO paralogue binding sites before and after KSHV reactivation. Examples of the epigenetic features associated 

with no expression (A and B), low expression (C and D), medium expression (E and F), high expression (G and H), and very high expression 
(I and J) gene loci. No expression: blue; low expression: green; medium expression: yellow; high expression: orange; very high expression: red. 



are preferentiaEy localized on the promoters of highly 
expressed genes, and that SUMO-2/3 is predominantly 
found associated with highly expressed genes that show 
no change in expression during herpesvirus reactivation. 
Finally, after viral reactivation, SUMO-2/3 is significantly 
associated with the promoters of genes in pathways re- 
lated to cellular immune responses, cytokine signaling, cell 
growth and apoptosis. To our knowledge, our findings are 
the first to compare dynamically the global chromatin- 
binding profiles of SUMO-1 and SUMO-2/3 across the 
human genome and suggest that, while the binding profile 
of SUMO paralogues is similarly under un-induced condi- 
tion, they do change differently during KSHV infection. 

Herpesviruses have evolved multiple mechanisms to 
target SUMOylation pathways, including modulating 
SUMO conjugation enzymes (SUMO El ligase, SUMO 
E2 ligase and SUMO E3 ligase) and deconjugation en- 
zymes (SUMO-specific proteases; SENP) as well as by 
directly targeting SUMOylated proteins [30]. Interest- 
ingly, KSHV encodes a SUMO E3 ligase in the lytic 
phase and this enzyme is likely to be the reason behind 
the increase in SUMO-2/3 paralogues present on chro- 
matin during viral reactivation [16]. However, this hy- 
pothesis needs to be rigorously tested via a knock-in 
recombinant KSHV containing a SIM mutant of K-bZIP 
that results in a loss of its SUMO E3 ligase activity. This 
will be an interesting direction to investigate in the fu- 
ture. Moreover, we cannot exclude the possibility that the 
induction of K-Rta activates host SUMO E3 ligase to de- 
posit SUMO-2/3 at the promoter regions. For example, 
we have previously identified a host factor, KAPl, is phos- 
phorylated by KSHV vPK during KSHV reactivation [23] 



and KAPl has recently been reported to be a SUMO E3 
ligase for IRF-7 [52]. 

The complete sequence of the human genome was ob- 
tained more than a decade ago; nevertheless, our under- 
standing of this genome is far from complete. The 
emerging concept from Encyclopedia of DNA Elements 
(ENCODE) is that biochemical functions of a genome 
can be assigned by systematically identifying the func- 
tional elements within the genome [53]. Patterns in 
chromatin modification or transcription factor binding 
onto the functional elements assists with the prediction 
of their role, particularly when RNA expression is exam- 
ined. The global but uneven distribution of SUMO 
modification near TSSs prompted us to study the distri- 
bution of SUMO modification on different functional el- 
ements of the genome, such as promoters, coding 
sequences (transcripts), upstream gene regions, down- 
stream gene regions, and intergenic regions. The signifi- 
cant enrichment of SUMO paralogues in promoter 
regions (Figure 2D and 2E) strongly suggests that 
SUMOylation may be involved in regulating gene tran- 
scription. Consistent with previous reports from lower 
eukaryotics and another describing SUMO-1 in HeLa 
cells [7,46], the correlation between SUMO paralogues 
binding to promoter region and higher levels of gene 
transcription, which is also found in the present study 
(Figure 10), further supports the potential role of 
SUMOylation in maintaining the expression of constitu- 
tively active genes. Moreover, SUMO-1 and SUMO-2/3 
may function in a similar manner maintaining the expres- 
sion of transcriptional active genes in non-reactivated con- 
trol cells. 
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Figure 14 SUMO-2/3 enrichment in stabilizing gene transcriptional during KSHV reactivation. (A) TREx-F3H3-K-Rta-shSUMO-2/3 BCBL-1 
cells were treated with Dox for 24 and 48 hours. TCLs were analyzed by immunoblotting using anti-SUMO-2/3 antibody. (B) Twelve IRF-1, IRF-2 and IRF-7 
targeted genes showing SUMO-2/3 enrichment at the promoter region during KSHV reactivation were chosen. Two genes showing no SUMO-2/3 
enrichment at the promoter region were chosen as control. RNA samples derived from TREx-F3H3-K-Rta BCBL-1 and TREx-F3H3-K-Rta-shSUMO-2/3 
BCBL-1 cells before and after 24 hours of Dox induction were subjected to reverse transcription (RT) reaction. Following the RT reaction, the IRF target 
genes were amplified by qPCR using gene-specific primer sets. All reactions were run in triplicate and normalized against GAPDH. 



SUMO binding onto chromatin must occur via either 
the modification of chromatin remodeUng proteins or 
the modification of transcription factors, both of which 
bind to the genome. SUMO shows focal peaks or areas 
of high occupancy within the promoter region near 
TSSs. The focal and gene-selective nature of SUMO oc- 
cupancy resembles the peaks associated with transcrip- 
tion factors, which suggests that there is SUMO 
modification of TFs. Motif scanning is a powerful 
method to facilitate the identification of DNA binding 
motifs (or transcription factor binding motifs) from 
peaks defined by ChlP-seq. This method has been widely 
used to distinguish the transcription regulation of one or 
a few TFs. SUMO modifications are able to occur in 
many dozens of known TFs as well as being likely to 
occur in many currently unknown TFs. Using the 
current findings, it is probably too complex and too time 
consuming to carrying full scale motif scanning to 



identify potential SUMO target TFs. Therefore, as an al- 
ternative, we used an annotation method that directly 
annotates SUMO peaks in the promoter region in rela- 
tion to transcription factor binding sites (TFBS). The de- 
tails of this method have been submitted in another 
article [54]. Briefly, the Transfac Matrix Database (v7.0) 
created by Biobase contains 258 TFBS weight matrices 
representing the potential DNA binding sites of 176 TFs 
and this database was chosen to annotate the SUMO 
peaks. Using this method, we were able to simultan- 
eously identify potential SUMOylated TFs. Potential 
SUMO target TFs are those TFBSs that show a signifi- 
cant correlation with SUMO peaks and these were iden- 
tified by the Hampel Identifier. Half of all SUMO-1 and 
half of the top-20 SUMO-2/3 potential target TFs identi- 
fied before and after viral reactivation were known 
SUMO targets. The other half may be potential SUMO 
targets that have not been identified as yet or proteins 



Table 5 GO categories of SUMO targeting genes during KSHV reactivation 
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IL-8 signaling 


205 


29 


1.52E-01 


28 


1.61E-01 


29 


1.25 E-01 


48 


4.68E-02 




NF-kB activation by viruses 


82 






12 


1.99E-01 






23 


2.00E-02 




p38 IVIAPK signaling 


117 


22 


4.47E-02 


18 


2.10E-01 


17 


3.10E-01 


26 


3.80E-01 




Toll-like receptor signaling 


62 


14 


1.05E-02 


9 


2.55E-01 






17 


5.75E-02 


Cytokine signaling 


IL-17A signaling in airway cells 


72 


9 


4.33E-01 


12 


8.91 E-02 


10 


2.72E-01 


21 


1.35E-02 




Prolactin signaling 


80 


10 


4.78E-01 


13 


1.21E-01 






22 


3.72E-02 


Cancer 


Glioma signaling 


112 


17 


1.03E-01 


14 


3.01 E-01 






29 


1.55E-02 




p53 signaling 


96 


18 


5.13E-02 


16 


1.17E-01 


16 


1.24E-01 


32 


1 .48E-03 




Small cell lung cancer signaling 


89 


13 


1.24E-01 


13 


1.02E-01 


13 


1 .08E-01 


27 


6.03E-04 




Sonic hedgehog signaling 


33 


6 


1.55E-01 


9 


7.24E-03 


5 


3.03E-01 






Cellular growth, proliferation 


Cleavage and polyadenylation of Pre-mRNA 


12 


5 


1.32E-02 


4 


5.37E-02 


3 


1.87E-01 


5 


8.32E-02 


and development 


EGF signaling 


62 




1.02E-01 


8 


4.05E-01 






21 


2.88E-03 




FAK signaling 


101 






15 


1.15E-01 


12 


4.06E-01 


27 


1 .32E-02 




JAK/Stat signaling 


70 


13 


8.71 E-02 


13 


7.08E-02 


10 


3.40E-01 


26 


5.37E-04 




Oncostatin M signaling 


35 


5 


4.55E-01 


5 


4.26E-01 






12 


3.S5E-02 




PDGF signaling 


85 


12 


2.92E-01 


12 


2.54E-01 






25 


1 .07E-02 




Thrombopoietin signaling 


63 


8 


4.22E-01 


10 


1.45E-01 






17 


4.90E-02 




VEGF signaling 


103 


17 


7.59E-02 


19 


1.66E-02 


13 


3.66E-01 


25 


7.59E-02 


Cell cycle regulation 


Cyclins and cell cycle regulation 


89 


14 


1.38E-01 


11 


4.05E-01 


15 


6.92E-02 


24 


2.75E-02 




GADD45 signaling 


22 


4 


2.26E-01 


3 


4.33E-01 


5 


8.32E-02 


9 


8.51 E-03 




Mitotic roles of polo-like kinase 


70 




2.46E-01 




2.14E-01 


14 


3.89E-02 


18 


1.39E-01 



Table 5 GO categories of SUMO targeting genes during KSHV reactivation (Continued) 



Growth factor signaling 


ErbB2-ErbB3 signaling 


60 


- 


- 


11 


9.55E-02 


- 


- 


19 


1 .82E-02 


Apoptosis 


April mediated signaling 


43 


8 


1.10E-01 


6 


3.40E-01 


8 


1 .OOE-01 


13 


3.72E-02 




Aryl hydrocarbon receptor signaling 


161 


24 


8.51 E-02 


19 


3.81 E-01 


23 


1 .07E-01 


40 


1 .48E-02 




IVIyc mediated apoptosis signaling 


60 


10 


2.13E-01 


10 


1.85E-01 


9 


3.10E-01 


20 


1.05E-02 


Cellular stress and injury 


Endoplasmic reticulum stress pathway 


18 


7 


5.25E-03 


5 


6.61 E-02 


5 


6.76E-02 


7 


6.03E-02 


Disease-specific pathways 


Parkinson's signaling 


16 


3 


3.44E-01 


4 


1.31 E-01 


5 


4.27E-02 


4 


4.32E-01 


Neurotransmitters and other 


Cholecystokinin/gastrin-mediated signaling 


106 


21 


1.82E-02 


13 


5.05E-01 


15 


2.94E-01 


24 


2.59E-01 



nervous system signaling 



bold: with statistical significance. 



Table 6 GO categories of SUMO targeting genes with no expression changes during KSHV reactivation 



SUMO-2/3 12 h SUMO-2/3 enriched at 12 hr 



All genes Gene expression no change Gene expression no change 





Pathway category 


Genes in 
category 


Genes 
identif i ed 


p-value 


Genes 
identif i ed 


p-value 


Genes 
identified 


p-value 


PsthoQsn-influsncBd signslinQ 


Virus entry via endocytic pathways 


99 


31 


1.41 E-03 


28 


3.24E-03 


20 


5.13E-03 


Cellular immune response 


Antigen Presentation pathway 


40 


14 


1.26E-02 


12 


3.63E-02 


g 


3.24E-02 




CD40 signaling 


70 


23 


2.82E-03 


21 


4.79E-03 


14 


1 .91 E-02 




HMGBl signa ing 


99 


25 


9.33E-02 


24 


646 E-02 


16 


1.03E-01 




IL-1 0 signaling 


78 


19 


9.55E-02 


16 


2.07E-01 


1 1 


2.10E-01 




IL-1 5 signaling 


67 


20 


1.86E-02 


19 


1.55E-02 


16 


2.34E-03 




IL-3 signaling 


74 


23 


1 .41 E-02 


22 


1 .02 E-02 


14 


4.90E-02 




IL-8 signaling 


205 


48 


4.68E-02 


44 


6.92 E-02 


27 


2.14E-01 




NF~kB activation by viruses 


82 


23 


2.00E-02 


21 


3.16E-02 


15 


3.39E-02 




p38 MARK signaling 


117 


26 


3.80E-01 


24 


3.88E-01 


16 


3.79E-01 




Toll-like receptor signaling 


62 


1 7 


5. 75 E-02 


15 


9.5 5 E-02 


1 1 


7.76E-02 


Cytokine sic|nalinc| 


IL-17A signaling in airway cells 


72 


21 


1.35E-02 


2 


8.51 E-02 


1 1 


1.58E-01 




Prolactin signaling 


80 


22 


3.72E-02 


22 


1 .45 E-02 


17 


6.46E-03 


Cancer 


Glioma signaling 


112 


29 


1.55E-02 


27 


1.62 E-02 


19 


2.14E-02 




p53 signaling 


96 


32 


1.48 E-03 


27 


1.23E-02 


20 


8.51 E-03 




Small cell lung cancer signaling 


89 


27 


6.03E-04 


26 


3.63E-04 


20 


2.57E-04 


Cellular growth, proliferation and development 


Cleavage and polyadenylation of pre-rnRNA 


12 


5 


8.32E-02 


5 


6.03 E-02 


4 


5.01 E-02 




EGF signaling 


62 


21 


2.88E-03 


20 


2.34E-03 


13 


1 .66E-02 




FAK signaling 


101 


27 


1.32E-02 


26 


9.33E-03 


17 


3.47E-02 




JAK/Stat signaling 


70 


25 


5.37E-04 


24 


8.71 E-04 


20 


1 .07E-04 




Oncostatin M signaling 


35 


12 


3.55E-02 


1 1 


4.47E-02 


8 


5.01 E-02 




PDGF signaling 


85 


25 


1.07E-02 


24 


6. 92 E-03 


18 


5.01 E-03 




Thrombopoietin signa ing 


63 


17 


4.90E-02 


17 


2.29E-02 


12 


3.31 E-02 




VEGF signaling 


103 


25 


7.59E-02 


25 


3.47E-02 


16 


9.55E-02 


Cell cycle regulation 


Cyclins and cell cycle regu ation 


89 


24 


2.75E-02 


23 


2.19E-02 


13 


1.80E-01 




GADD45 signaling 


22 


9 


8.51 E-03 


8 


2.40E-02 


4 


2.26E-01 




Mitotic roles of polo-like kinase 


/u 












4.44t-U 1 


Growth Ffactor signaling 


ErbB2-ErbB3 signaling 


60 


19 


1.82E-02 


19 


7.08E-03 


15 


3.02E-03 


Apoptosis 


April mediated signaling 


43 


13 


3.72E-02 


11 


9.12E-02 


6 


3.24E-01 




Aryl hydrocarbon receptor signaling 


161 


40 


1.48E-02 


34 


6.92E-02 


21 


1.96E-01 




Myc mediated apoptosis signaling 


60 


20 


1.05E-02 


19 


8.71 E-03 


14 


9.33E-03 


Cellular stress and injury 


Endoplasmic reticulum stress pathway 


18 


7 


6.03E-02 


7 


3.98E-02 


4 


1.73E-01 


Disease-specific pathways 


Parkinson's signaling 


16 


4 


432E-01 


4 


3.61 E-01 






Neurotransmitters and other nervous system signaling 


Cholecystokinin/gastrin-mediated signaling 


106 


24 


2.59E-01 


20 


4.56E-01 







bold: with statistical significance. 
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containing a SIM domain that provides an additional 
interaction platform allowing the recruiting of other 
SUMOylated proteins; both of these situations may be 
responsible for the TFs identified here. The SUMOyla- 
tion fraction in a steady state is typically very little in re- 
lated to the entire pool of transcription factors. Efforts 
are still needed to confirm the results outlined here and 
to elucidate the underlying functions of SUMOylation 
during the regulation of these TFs. Interestingly, when 
we ranked the potential SUMO-2/3 target TFs by the 
total number of their regulating genes (Figures 4 and 5), 
we found three IRFs that were not SUMO-2/3 targets in 
the control cells that were listed as top 4th, top 5th and 
top 6th of the SUMO-2/3 target TFs after viral reactiva- 
tion. IRFs constitute a family of TFs (IRF-l-IRF-9) that 
are in control of the type I interferon (IFN) system and 
are involved in executing the innate and adaptive im- 
munity associated with host resistance against patho- 
gens, including virus infection. To promote its own 
survival, KSHV exploits a number of different strategies 
to suppress the host immune system. Recent evidence 
has shown that the virus triggers the SUMOylation of 
IRFs, leading to a targeting and blocking of the type I 
interferon pathway [24,40,41]. K-bZIP of KSHV has also 
been found to inhibit type I IFN signaling in a signal 
transducers and activators of transcription (STAT) 
dependent manner and in an IFN-stimulated gene factor 
3 (ISGF3) independent manner [39]. Moreover, KSHV 
K-bZIP inhibits IRF-3 by preventing IRF-3 from binding 
to target promoter, which precludes the formation of the 
enhanceosome. The potential SUMO-2/3 target IRFs 
identified here (Figure 5) provides an additional novel 
mechanism for globally inhibiting the activation of the 
host immune system. 

The growing links between the viral and cellular SUMO 
systems makes SUMO a potential target for antiviral ther- 
apy [21]. Identifying the preferential usage of SUMO para- 
logues in viruses may help to improve the specificity of 
any SUMO-targeted antiviral therapies. Recently, growing 
evidence, including ours, suggests that some herpesviruses 
have a preference for SUMO-2/3 [16,55]. Significant in- 
crease in SUMO-2/3 coating across human genome, but 
not in SUMO-1 coating, during viral reactivation found 
here suggest that a new class of combine therapy targeting 
SUMO-2/3 may disrupt the dynamic balance of the her- 
pesvirus latent and lytic phases. Disrupting the balance 
may help the clearance of the herpesvirus from the in- 
fected cells and improve current therapy. 

In summary, we found that SUMO-1 and SUMO-2/3 
share a highly similar binding landscape on chromatin. 
They are preferentially enriched in promoter regions and 
are associated with highly transcribed genes. Differential 
chromatin-binding profiles of the SUMO paralogues are 
able to be observed during herpesvirus reactivation. We 



found that SUMO-2/3 peaks significantly increased in 
promoter regions during viral reactivation and this was as- 
sociated with the genes that do not undergo changes in 
transcription level. TFs identification and GO analysis sug- 
gests that SUMO-2/3 preferentially target immune path- 
ways during viral reactivation. 
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